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Abstract 

Although norm-referenced tests have been a mainstay in special 
education placement and program evaluation activity, there never has 
been a systematic c-ffort to determine whether handicapped children 
have been included In the development of the tests' norms, items, and 
indices of reliability and validity. The present investigation 
explored this question by analyzing up-to-date user manuals and 
technical supplements of 27 aptitude and achievement tests reviewed by 
Salvia and Ys5eldyl<e <1978)c Results indicated most test developers 
and publishers provide scant data on the appropr i -^teness of their 
tests for use with handicapped children. Implications for test users, 
developers, and publishers are discussed. 
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Norm-Referenced Tests! Are They Valid for Use 
with Handicapped Children? 

Recently there has been increased use of criterion-referenced 
tests. Nevertheless, norm-referenced measures, including IQ, 
achievement, and screening instruments, continue to be employed widely 
in special education, providing data for diagnostic, classification, 
program placement, and evaluation decisions. Despite or perhaps 
because of their long-term use, norm-referenced tests havt* been the 
focus of much criticism. 

Special education teachers and administrators, parents and 
officers of the court, professional organizations responsible for 
establishing testing standards, and educational researchers have 
expressed a variety of concerns. Critics contend that <a) test content 
discriminates against certain subgroups of the population (e.g., Co3e 
4 Bruner, 1972} Gould, 1981} McClelland, 1973; Ogbu, 1978), <b) the 
technical characteristics of many tests are inadequate <e.g., 
Ysseldyke L Shinn, 1981), <c) typical testing procedures are biased 
against low SES examinees (Fuchs & Fuchs, in press), <d) many tests 
both fail to explore pupils' cognitive processes < Gallas it Sigel. 
1979) and to guide instruction (e.g.. Hunt, 1975; Jenkins & Pany, 
1978), (e) tests facilitate the use of pejorative labels <e.g., 
Reynolds & Balow, 1972} Scriuen 1975), which, in turn, negatively 
affect teacher judgments (e.g., Foster, Ysseldyke, & Reese, 1975} 
Palmer, 1983), and (f) examiners can be incompetent in selecting 
appropriate tests (e.g., Keogh, Kukic, Becker, McLoughlin, & Kukic, 
1975), inaccurate in administering and scoring tests (e.g., Bennett, 
1983), and imprudent in failing to base instructional recommendations 
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and decisions on available adsessment data (e*g*» Ysseldyke & 
Algozzine, 1982). 

Another important issue » one which has been discussed much 
less frequently, is whether test constructors employ handicapped 
children in the development of test norms, items, and indices of 
reliability and validity. This concern draws much of its legitimacy 
and importance from recent federal legislation that demands an end to 
the discrimination of the handicapped and encourages their 
participation in the mainstream of American life. 

Both PL94-142 and Section 504 of the Rehabilitation Act of 
1973 recognize that testing may discriminate against the handicapped. 
Such recognition is implicit in Section 504, which establishes 
placement procedures to ensure that "children are not misclassif led, 
unnecessarily labeled as handicapped, or Incorrectly placed because of 
inappropriate selection, administration, or interpretation of 
evaluation materials" (42 Fed. Reg . 22691, cited In Sherman & 
Robinson, 1982). To eliminate the use of placement procedures that 
discriminate against the handicapped. Section 504 requires that tests 
be validated for the specific purpose for which they are used. In 
other words, they should be selected and administered so that, when 
given to handicapped students, results accurately reflect pupils' 
aptitude, achievement, or whatever other factors the tests purport to 
measure rather than reflecting the children's disabilities (except, of 
course, when the specific disabilities are the factors that the tests 
claim to measure). In short, before tests may be considered 
nondiscriminatory, or unbiased estimates of handicapped children's 
potential, they should measure the same abilities and skills among 
handicapped and nonhandicapped groups. 
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Contributing to the seriousness of this concern is the 
apparent complexity involved in the development of tests that are 
valid for use with both handicapped and nonhandicapped groups. As 
described by the National Research CounciTs Pane! on Testing 
Handicapped People (Sherman & Robinson, 1982), major modifications in 
the medium and method of test administration, as well as changes in 
test content, often are necessary before many handicapped children may 
be evaluated in a valid manner. For example, visually impaired 
children collectively may require a variety of modifications in the 
medium in which a test is administered: It may be necessary to produce 
test booklets in large print, high quality regular print, or braiMe, 
to tape record the test, or to read it aloud to the child. Th« method 
used to record a response also may require alteration: Visually 
impaired examinees who cannot record their answers may need the 
assistance of a tape recorder, typewriter, braille writer, or person 
who v^rltes or marks the answerrs. 

Likewise, changes in test content may be required. Items may 
be unnecessarily difficult (i.e. discriminatory) for a visually 
Impaired child if they measure knowledge, skills, or concepts learned 
primarily through vision or if the items use visual stimuli to measure 
knowledge acquired through other senses. Further complicating the 
formulation of valid tests for the handicapped is that the 
modifications developed for appropriate evaluation of visually 
impaired children may be qualitatively different from modifications 
necessary for the accurate testing of hearing impaired pupils} that 
is, children with contrasting handicapping conditions may require 
substantively different changes in the medium and method of test 
administration and test content. 
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Given the importance and apparent difficulty of developing 
normative tests that measure the same abilities and skills among 
h.xndicapped and nonhandi capped children^ the obvious question is, "Do 
they?" An important step toward answering this question is to explore 
test developers' user manuals and technical supplements to determine 
whether, and if so to what extent, handicapped children participated 
in the development of widely used normative tests. Although there have 
been occasional assertions that handicapped children have been 
excluded from most normative tests' development (Bennett, 1983} Jones, 
1973} Sherman & Robinson, 1982), we know of no systematic empirical 
attempt to determine the degree of participation of handicapped 
children in the creation of well known and widely used tests' norms, 
items, and indices of reliability and validity. This was the primary 
purpose of the present study. 

Method 

Procedure 

The study's procedure included four basic steps: selecting 
tests, contact ing publ ishers for current and complete technical 
information, analyzing the technical data, and determining interrater 
agreement . 

Selecting tests. Table 1 lists the titles, authors, 
publishers, and copyright dates of the 27 tests included in the 
analysis. These represent all but one o-^ the tests reviewed by Salvia 
and Ysseldyke <1978) in the following categories and sub-categories: 
(a) norm-referenced achievement tests, both screening and diagnostic, 
and <b) individually administered IQ tests, including global measures, 
picture vocabulary tests yielding IQ scores, and IQ tests designed for 
specific populations These tests were selected, and presumably were 
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reviewed by Salvia and Ysstldyke, because they are well known and 
widely used in special education. 



Insert Table 1 about here 



Supporting this contention are data from a survey conducted by 
6oh, Tesla^, and Fuller <1981) of a geographically representative 
sample of members of the National Association of School Psychologists. 
Respondents were asked to name the eight tests they used most often in 

each of a number of assessment areas. Among the tests reviewed by J 

Salvia and Ysseldyke, and included in this study, 14 measures were 

mentioned by the psychologists! 6, 4, and 4 in the areas of 

Intelligence, Achievement-General, and Achievement-Specific, 

respectvely. In a similar investigation, Mardel 1 -Czudnowsk i (cited in 

LaGrow (< Prochnow-LaGrow, 1982) surveyed one-fifth of all licensed 

full-time school psychologists eir^ployed in Illinois in 1980. The 

survey asked respondents to" identify the most frequently used tests in 

special education. The five aptitude or achievement tests named most 

often are among the 27 tests included in this study. 

Contacting publishers. The publishers of each test were 
contacted for two reasons: to determine whether we had the most recent 
user manual and to explore whether additional technical information 
was available in published supplements, bulletins, or handbooks. If we 
had out-dated or incomplete test iniormation, we ordered and obtained 
the most recent user manual and all published supplementary technical 
inf ormat i on • 

Analyzin g technical data. The user manual and /or technical 
supplement of every test was analyzed in terms of norms, item 
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development, internal and test-retest reliability, and concurrent and 
predictive validity. A matrix was constructed to include the 27 tests 
(rows) and 6 psychometric characteristics (columns), yielding 162 
cells. For each test and every psychometric property, we inquired 
about handicapped children's inclusion. Uhen no data were presented 
for a psychometric characteristic of a given test, the corresponding 
cell in the matrix was left blank. If it was stated that handicapped 
children participated in the development of a psychometric property, 
but no percentage of involvement was specified, a "Yes" was recorded. 
When the percentage of handicapped children ''s involvement was made 
clear, the specific percentage was entered. Finally, if ihe test 
manual and technical supplement were unclear about handicapped 
children's participation, a question marl< (?) was recorded. 

Determining interrater agreement. Interrater agreement was 
determined using the formula suggested by Coulter (cited in Thompson, 
White, & Morgan, 1982): Percentage of agreement = agreements between 
raters A and B/agreements 4 disagreements between raters A and B 4 
omissions by rater A ^ omissions by rater B. Calculated on 100% of the 
data entered in the matrix, interrater agreement was .84. 
Disagreements were resolved through discussion between the two raters. 

Resul ts 

Table 2 displays the matrix described above. Two distinctive 
characteristics of the matrix are its numerous question marks and 
blank spaces. Each of these features represents important and related 
facts about the 27 popular tests. 
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Insert Table 2 about here 



Question Marks 

The question marks aro oi two types: those with and without 
-footnotes. As discussed above, those wi thou t footnotes signify the 
failure of test developers to clarify whether they included 
handicapped children In the standardization populations, development 
of test items, and establishment of indices of reliability and 
validity. Many of the question marks wi th footnotes indicate the 
existence of technical data generated on handicapped children, but 
this information was not produced by the test developers. Instead, 
these data typically were produced by researchers uninvolved in test 
development. These researchers frequently employed small and 
homogeneous groups of children, who were not part of any normative 
population, and sometimes administered the tests in ways different 
from the procedures used by the test developers. Because of the size 
and nature of these groups and the sometimes incomparable ways In 
which test developers and researchers administered the tests, we 
believe the technical data referenced by the footnotes may be 
inadequate and potentially misleading. Additionally, since some of 
this information is not readily a»)ailable to most test users, we 
believe it does not always represent a useful data base. Thus, for the 
majority of footnoted question marks in the matrix, we see no reason 
to distinguish them from the question marks without footnotes. 

Regarding the question marks in an undifferentiated manner 
indicates that they constitute 52M of the matrix <or 84 of 162 cells). 
Although no test developer explicitly excluded the handicapped from 
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the norms, Item development, or establishment of indices of 
reliability and validity, only 5 of 27 tests provide specific 
percentages of handicapped children's participation in some aspect of 
test development. These five tests account for a mere 12 of 162 cells, 
or TA of the matrix, displaying specific percentages of handicapped 
children's Involvement. In three additional cells <2>{ of the matrix), 
item development of the McCarthy Scales of Children's Abilities and 
Slosson Intelligence Test and norms of the Silent Reading Diagnostic 
Test, a "yes" is displayed, indicating handicapped children were 
Involved but at an unspecified level. This widespread vagueness 
extends even to tests specifically designed for or widely used with 
special populations. Among IQ tests for special populations shown in 
Table 2, only the Blind Learning Aptituoe Test and Nebraska Tost of 
Learning Aptitude provide specific percentages of handicapped 
children's participation In test development. 
Blank Spaces 

As with the question marks, some blank spaces in Table 2 are 
footnoted and some are not. Those without footnotes mark instances in 
which test constructors do not provide users with any technical 
information at all. Many of the blank spaces with footnotes indicate 
technical data exist, but were not produced by the test developers. We 
believe much of these data may be inadequate and misleading for many 
of the same reasons discussed with respect to the footnoted question 
marks. 

Viewing the blanks without regard to footnotes indicates that 
many of the developers of the 27 popular tests offer scant technical 
data: The Stanford-Bine t fails to report indices of internal or 
test-retest reliability cr of concurrent or predictive validity} the 
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Lei ten presents no data on the nature of its norms, item development, 
internal and test-retest reliability, or :;oncurrent and predictive 
validity} the Peabody Picture Vocabulary Test-Revised, Metropolitan 
and Stanford Achievement Tests, Uide Range Achievement Test-Revised, 
Gray Oral Reading Test, Silent Reading Diagnostic Test, and Woodcock 
Reading Mastery Tests have neither concurrent nor predictive validity 
information; and the Quick Test does not provide pertinent reliability 
data. For a psychometric property such as predictive validity, only 3 
of the 27 tests possess pertinent data. 

Across all \62 cells of the matrix, 63 "ells (39X) are blank. 
Combining these 63 cells with the 84 additional ones containing 
question marks, reveals that, for H7 of 162 cells <?1X of the 
matrix), test developers provide no information on the nature and 
extent of handicapped children's involvement in test development. 

Pi scuss i on 

Findings indicate that, among 27 widely used normative tests, 
numerous test constructors provide no evidence to indicate their tests 
are valid for use with handicapped children. A majority of cells in 
the matrix displayed in Table 2 contain question marks, indicating 
widespread failure among test developers to clarify the population on 
which they developed their instruments. It is possible test 
constructors employed handicapped children (without identifying them 
as handicapped) more frequently than our findings indicate. However, 
until many of them describe more completely the children on whom thty 
have developed their tests, they and their publishers are in explicit 
violation of widely recognized professional standards for test 

development. ', 
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The Standards for Educational and Psychological Testing 
(Standards ; American Educational Research Association $ American 
Psychological Association, & National Council on Measurement in 
Education, 1985) call for clear and comprehensive descriptions of 
those employed in the development of test norms (see Standard 4.3, p. 
33), items (see Standard 3.12, p. 28), and indices of reliability (see 
Standard 2.2, p. 20) and validity (see Standard 1.5, p. 14). If, in 
fact; test constructors have not validated their instruments for use 
with handicapped people, they "... should issue cautionary statements 
in manuals and elsewhere regarding confidence in interpretations ..." 
based on these tests (Standard 14.2, p. 79). Moreover, if such "... a 
test is likely to be used incorrectly for certain kinds of decisions, 
specific warnings against such use should be given" (Standard 1.2, p. 
13). In our analysis of the manuals and technical supplements of the 
27 popular tests, we rarely found admonishing qualifications in the 
information describing the tests* intended uses. 

Underscoring the need for test developers to provide data on 
the use of their instruments with handicapped people is empirical 
evidence indicating typical test procedures discriminate against 
certain handicapped children's performance. In a series of studies, 
Fuchs and associates have shown that speech- and language-impaired 
children do not perform optimally when tested by an examiner with whom 
they are unfamiliar (see Fuchs, Feathers tone > Garwick, & Fuchs, 1984; 
Fuchs, Fuchs, Dailey, & Power, 1985; Fuchs, Fuchs, Garwick, & 
Featherstone, 1983). Most recently^ Fuchs, Fuchs, Power, and Dailey 
(1985) demonstrated that, although speech- and language-handicapped 
children perform significantly less well with unfamiliar than with 
familiar examiners, nonhandicapped children perform equally well in 
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both condi t ions* 

These findings have important practical implications because 
of two apparent features of many normative tests. First, an average 
normative score probably reflects the mean performance of 
nonhandicapped children* Second, during the development of the norms, 
tests typically are administered by unfamiliar examiners <Fuchs, 
1981). The research of Fuchs, Fuchs, Power, and Dai ley <1985) 
indicates the use of unfamiliar examiners in the norming process does 
not depress the performance of the nonhandicapped standardization 
population. Hoi'^ever, when speech- and language-impaired children are 
assessed by unfamiliar examiners, their suboptimal performance is 
compared to the more optimal performance of the nonhandicapped 
normative group. In such comparisons, examiner unf ami 1 i ar i ty is a 
source of systematic error or bias that invalidates test-related 
diagnostic, classification, and program placement decisions. 

Documenting the important fact that many test developers do 
not provide validity information on the use of their instruments wi th 
handicapped children does not absolve test users from the 
responsibility to employ appropriate tests and procedures wi th this 
group. Rather, according to the StandardSt the absence of this 
information incr ^ases users' obligations to test handicapped pupils in 
a manner that does not discriminate selectively against them. On page 
42. Standard 6.3 states, "When a test is to be used for a purpose for 
which it has not been previously validated, or for which there is no 
supported claim for validity, the user is responsible for providing 
evidence of val id! ty" (italics added). Because the Standards often are 
a guide in litigation, special education teachers, school 
psychologists, speech clinicians, and other test users should 
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recognize the potential liability in administering tests without 
pertinent vaMdity information. Such recognition, however, may not be 
widespread. Evidence suggests that at least some examiners tend "to 
select tools in a routinized manner without consideration for the 
purposes of assessment, an instrument's technical adequacy for those 
purposes, or even the basic descriptive information contained in 
Instrument manuals" (Bennett, 1983). 

As mentioned at the outset, validating tests for the 
handicapped is a complex process. For test constructors and publishers 
this complexity no doubt represents additional expenditures of effort 
and money (see Sherman & Robinson, 1982). Whereas some "bottom line" 
test developers and publishers may regard validating their tests on 
the handicapped as a project they cannot afford to do, we view such an 
effort as something they "cannot afford not to do." Tests without 
validation data on the handicapped simply cannot and should not be 
used with this group for purposes cf determining diagnoses, 
classifications, program placements, and evaluations of academic 
progress. 

This Is not only our view. It is the position expressed in 
PL94-142 and Section 504 of the Rehabilitation Act of 1973} it runs as 
a motif through the Standards! and, we believe, it represents the 
sentiments of many handicapped children and their families, teachers, 
and testers. Fortunately, some private Institutions have made 
important contributions toward validating some tests for the 
handicapped. The American Printing House for the Blind, for example, 
produces and distributes large print and braille versions of many 
norm-referenced tir5ts <W. Duckworth, personal communication, October 
25, 1985). With respect to the 1982 edition of the Stanford 
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Achievement Test, Gallaudet College has modified testing procedures 
and established norms specifically for hearing-impaired children (see 
Allen, White, & Karchmer, 1983). Additionally, it is encouraging to 
note that at least some test developers and publishers have attempted 
to validate their tests for use with the handicapped (see Table 2), 
and that others, such as Educational Testing Service, are In the 
process of doing so (see Bennett & Ragosta, 1984). 
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Table 1 

Titles, Authors, Publishers, and Copyright Dates 
of 27 Norm-Referenced Tests 







Author (s) of manual 








or technical 




Type of test 


Text name/abbreviation 


suppl ement/copy r1 ght 


Publ 1 sher 




McCarthy Scales of Children's Abilities 
(MSCA) 


McCarthy (1972) 


Harcourt Pracs Jovanovlch 




Slosson Intelligence Test (SIT) 


Slosson (1982)1 


Slosson Educational 

Puhl 1 Crtti nnt 


General 

IQ 


Stanford Binet Intelligence Scale (Binet) 

Wechsler Adult Ingalllgence Scale-Revised 
(WAIS-R) 


Terman 4 Merrill 
(1973) 

Wechsler (1981) 


Riverside 

Harcourt Brace Jovanovlch 




Wechsler Intelligence Scale for Children- 
Revised (WISC-R) 


Wechsler (1974) 


Harcourt Brace Jovanovlch 




Wechsler Preschool and Primary Scale 
for Intelligence (WPPSI) 


Wechsler (1967) 


Harcourt Brace Jovanovlch 


IQ tests 
for 


Blind Learning Aptitude Test (BLAT) 


Newland (1969) 


Harper and Row 


special 
populations 
and Picture 
vocabulary 
tests 


Columbia Mental Maturity Scale (CMMS) 

Leiter International Performance Scale 
(Leiter) 


Burgeny»1ster, 
Blum, ct Lorge 
(1972) 

Levine (1983)2 


Harcourt Brace Jovanovlch 
Stoelting 
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Table 1 (continued) 



Type of test. 



Test name/abbreviation 



Author(s) of manual 

or technical 

suppl ement/copyri ght 



Publisher 



IQ tests 

for 
special 
populations 
and Picture 
vocabulary 
tests 



Nebraska Test of Learning Aptitude (NTLA) 

Pictorial Test of Intelligence (PTI) 

Peabody Picture Vocabulary Test-Revised 
(PPVT-R) 

Quick Test (QT) 



Hiskey (1966) 

French (1964) 

Dunn & Dunn 
(1981) 

AinnonSo& Anmons 
(1962)3 



Marshall S- Hiskey 
Houghton-Mifflin 
American Guidance Service 



Psychological Test 
Specialists 



ERLC 



Achievement 
screening 
tests 
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CaMfornia Achievement Tests, Forms 
C and D (CAT) 

Gates-MacGinitie Reading Tests (GMRT) 

Metropolitan Achievement Tests Survey 
Battery (MAT) 

Peabody Individualized Achievement Test 
(PIAT) 

Stanford Achievement Test (SAT) 



Wide Range Achievement Test-Revised 
(WRAT-R) 



CTB /McGraw-Hill 
(1979)^ 

MacGinitie (1981)5 

Prescott, Balow, 
Hogan, i Farr (1978) 

■^unn 4 Markwardt 
970) 

Harcourt ^ 
Jovanovich (1984)° 

Jastak & Wilkinson 
(1984) 



CTB /McGraw-Hill 
Riverside 

Harcourt Brace Jovanovich 



American Guidance 
Service 

Brace Harcourt Brace Jovanovich? 



Jastak Associates 
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Table 1 (continued) 



Type of test 



Test name/abbreviation 



Author(s) of manual 

or technical 

suppi enent/copy r1 ght 



Publisher 



Achievement 
diagnostic 
tests 



Diagnostic Reading Scales (DRS) 

Durrell Analysis of Reading Difficulty 
(Durrell) 

Gil more Oral Reading Test (Gil more) 



Gray Oral Reading Test (Gray Oral) 
Reading Diagnostic Tests (RDT) 

Silent Reading Diagnostic Tests (SRDT) 



Stanford Diagnostic Reading Test/Red, 
Green, and Brown levels, (bDRT) 



CTB/McGraw-Hil 1(1982)7 

Durrell S Catterson 
(1980) 

Gil more i Gil more 
(1968) 

Gray (1967) 

Gates, McKillop, & 
Horowitz (1981) 

Bond, Balow, & Hoyt 
(1976) 

Karl son. Madden, & 
Gardner (1976) 



CTB/McGraw-Hill 
Harcourt Brace Jovanovich 

Harcourt B.*ace Jovanovich 
Pro-Ed 

Teachers College 
Meredith 

Harcourt Brace Jovanovich 



Woodcock Reading Mastery Tests (WRMT) 



Woodcock (1973) 



American Guidance Service 



nechnical manual. 

^Levine, M. N. (1983). Leiter international performance scale: A handbook. Chicago: Stoelting. 
"'Ammons, R. B., i Ammons, H. S. (1962). The Quick Test (QT): Provisional manual. Psychological Reports, 
n, 111-161. 
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%B/McGraw Hill (1979). The California Achievement Tests; Technical bulletin No. 1. Monterey, CA: CTB/McGraw- 
Hill • — — 

^MacGinitie, W. H. (1981). Technical summary. Chicago: Riverside. 

^Harcourt Brace Jovanovich (n.d.). Technical review kit for the Stanford Achievement Series. New York: Harcourt 
Brace Jovanovich. 

^CTB/hcGraw-Hin. (1982). Diagnostic Reading Scales: Technical report. Monterey, CA: McGraw-Hill. 
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Table 2 

Handicapped Children's Participation In the 
Technical Development of 27 Nona-Referenced Tests 



Reliability 



Validity 



Item 



Type of test 


Test name 


Norms 


development 


Internal 


Test-re test 


Concurrent 


Predictive 




MSCA 




Yes^ 


? 


? 


? 


? 




SIT 




Yes* 






24%^ 




General . 


Blnet 


? 


? 








_ 


IQ 


WAIS-R 


7' 




? 


? 


8 


8 


tests 


WISC-R 


7' 





? 


? 


10 







WPPSI 


? 




? 


? 


? 




IQ tests 


BLAT 




loox • 


lOOZ 


100% 


100% 




for 


CMMS 


? 


? 


? 


? 




12 


special 


Lelter 






13 


13 


13 


13 


populations 


NTLA 




100% 


100% 


• 


15 




and 


PTI 


? 


? 


? 


? 




17 


Picture 


PPVT-R 


? 


? 










vocabulary 
tests 


QT 


? 













Table 2 (continued) 
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I 

Reliability Validity 



Item 



Type of test 


Test name 


Norms 


development 


Internal Test-retest 


Concurrent 


Predictive 




CAT 


? 


? 


? ? 


? 





Achievement 


GMRT 


? 





? ? 


? 





screening 


MAT 


? 


? 










tests. 


PIAT 


? 


? 


? 


? ^" 







SAT 


? 


? 





? 





- 


WRAT-R 







? ? 


22 






DRS 


23 


? 


? ? 


,24 


,25 




Durrell 


? 


? 






,26 


Achievement 


Gilmore 


? 


? 




?27 




diagnostic 


Gray Oral 


,28 


? 








tests 


RDT 


? 




? 


? 






SRDT 


Yes 


? 










SDRT 


? 


? 




? 






WRMT 


? 


3%30 


? ? 







^Definition of "normal" seems to permit the possibility that mildly and moderately 
handicapped children were included (see page 16 of the manual). 
'On page 1 it is stated that mentally retarded children were involved, but no 
percentages or numbers are presented. 
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On page 21 of the manual it is reported that, among 1,109 standardization 
subjects, 266 were retarded with IQs less than 84 (x • 71.90, SD « 7.47). 
*0n page 3 it is stated that mentally retarded children and institutionalized adults 
were employed in item try-outs, but no percentage or number is given.* 

The standardization sample was not divided into ability groups for purposes of 
determining test stability (see page 33), 

^All 1,109 individuals in the normative group also completed the Binet. Also, 
concurrent validity studies employing the Binet, HISC, and HISC-R are reported on 
pages 55, 57, 58, and 60. 

'information on page 18 of the manual precludes the involvement of severely 
handicapped adults; it does not rule out the participation of mildly and moderately 
handicapped individuals. 

On pages 49-50 reference is made to validity studies conducted by others, which 
presumably did not involve the standardization sample. 
^The definition of "normal" does not preclude the involvement of mildly and 
moderately handicapped children (see page 19 of the manual). 
^^Concurrent validity studies employing the WAIS, HPPSI, and Binet are reported on 
pages 48-51. Only the group of children tested on the Binet were part of the 
standardization population. 

^^All subjects were blind; there is no mention of the possibility of additional 

handicaps (see page 2 of the manual). 
12 

On pages 42-47 of the manual there is an annotated bibliography of r^:liability and 
validity studies involving cerebral palsied, deaf, speech-impaired, mentally 
retarded, and neurologically-impaired children and either the 1954 or 1959 editions 
of the test. 

^"'on pages 78-86 and 89-95 of the manual, reliability and validity studies are 
reported for handicapped individuals. Kone of these studies appeared to employ 
individuals constituting the standardization population. 



Norm-Referenced-27 

l^On page 8 of the manual, it is stated that item try-outs were conducted with deaf 
and mentdlly retarded children. (It is unclear whether, and if so to what extent, 
the normative population of deaf children included pupils who also were mentally 
retarded. ) * 
^^A concurrent validity stuir was conducted on hearing children constituting the non- 
handicapped portion of the normative sample. An investigation involving students at 
the Iowa School for the Deaf is cited as evidence for the test's concurrent validity 
with deaf children (see pages 13-14). 

^^On pages 20-21 of the manual three concurrent validity studies are reported, one of 
which included 21 mentally retarded children. 

^^On pages 19-20, several predictive validity investigations are reported. These 
studies- involved subjects ranging in number from 9 to 28 who were not members of the 

standardization population. 
18 

On pages 56-58 of the manual a literature review of approximately 100 studies is 

presented, an unspecified numoer of which employed "mentally retarded or 

institutionalized children." 
19 

Additional concurrent validity studies are reported on pages 128-131 of the manual. 

It is unclear whether any of these investigations included handicapped individuals. 
20 

On page 51 of the manual, the findings from an additional concurrent validity study 

are reported, which was conducted on 46 mentally retarded children. 
21 

This number refers to the proportion of mentally retarded children included in the 
normative sample (see page 23 of the manual). It is unclear whether mildly and 
moderately handicapped pupils participated in the standardization. 
22no indices of concurrent validity are reported for the standardization population. 
On page 63 of the manual, summarized results of other studies are presented, although 

no mention is made of handicapped children's participation. 

23 

On page 3 of the Technical Report it is stated thai the children "were drawn from 
classes indentified as average. Highly accelerated students, students with severe 
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reading disabilities, and students who were unable to function In the English 

language were excluded from the sample." This description of participants does not 

ellmlntate the possibility that mildly or moderately handicapped children 

participated In the norming process. t 

^*0n page 9-10 of the Technical Report, additional concurrent validity studies are 

reported, none of which appears to have Included handicapped children. 

^^Several Investigations of predictive validity are described on page 11 In the 

Technical Report. One stu(Hy employed 62 "retarded readers." 

2^0n page 57 In the manual It Is claimed that the predictive validity of several 

subtests "has been studied extensively." However, there are no citations for these 

studies, no description of the criterion measures used, and no Indications of the 

types af children Included In the Investigations. 

27 

On pages 26-27 of the manual several additional concurrent validity studies are 
reported, none of which makes clear the possible participation of handicapped 

children. 

28 

On page 25 of the manual It Is reported that, "Pupils were eliminated who were 
known to have serious health or emotional problems. Those who stuttered or had other 
serious speech defects were not Included. Any pupil who had been double-promoted or 
who had repeated a grade was considered unsuitable. Otherwise, random selection 
procedures were followed." 

2Q 

^"Whole classes were tested to obtain the normative data, except for samples of 
remedial reading cases totaling 300 pupils" (see page 14 of manual). From this 
passage It Is unclear whether, and If so to what extent, handicapped children 

participated In this standardization effort. 

30 

On page 43 of the manual It Is stated that, among 36,527 pupils Involved In the 
effort to calibrate the difficulty of the test, 1,224 were mentally retarded and In 
special classes. It Is unclear whether additional malnstreamed handicapped children 
participated In the test's development. 
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